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1.   VISUAL  PATTERN  RECOGNITION 

1.1    The  Nature  of  the  Problem 

Visual  pattern  recognition  has  interested,  researchers  since 
the  dawn  of  the  computer  era,  e.g.  [Dinneen,  1955],  no  doubt  due  to 
the  importance  of  vision  in  everyday  life.   Yet  the  performance  of 
the  human  visual  system  has  yet  to  be  matched  or  even  approximated 
by  artificial  means,  as  our  understanding  of  its  nature  is  poor  and 
the  complexity  of  the  problem  is  immense.   The  technique  of  inferring 
the  structure  of  three-dimensional  space  from  a  two-dimensional  projec- 
tion combined  with  certain  depth  cues  provided  by  binocular  vision 
appears  to  require  an  extensive  background  of  knowledge  about  the 
nature  of  space,  the  nature  of  objects  and  the  relationship  of  signals 
or  possible  signals  produced  in  the  nervous  system  by  the  different 
modalities  of  perception  [Tunstall,  1973].   This  process  takes  years 
to  master  even  for  human  children  [Hunt,  196l].   Furthermore,  the 
problem  to  which  human  vision  is  a  solution  is  "ill-formed"  [Uhr,  1973]. 

Therefore,  to  construct  a  visual  pattern  recognition  system 
it  is  necessary  at  the  present  time  to  restrict  the  problem  in  some 
way.   One  approach  is  to  limit  the  universe  in  which  the  system  operates 
so  that  the  knowledge  base  and  inductive  processes  required  are  reduced 
to  a  tractable  point  [Griffith,  1973].   Another  is  to  realize  that 
the  full  power  of  human  perception  is  unnecessary  in  many  applications, 
so  that  acceptable  results  in  one  particular  problem  area  may  be 
obtained  by  simpler  means.   Such  a  technique  was  used  in  the  design 


of  the  "smart  bombs"  used  by  the  United  States  during  the  Vietnam  War, 
which  located  military  targets  such  as  bridges  by  matching  a  simple 
template. 

Usually  a  combination  of  the  two  approaches  is  taken — a  useful 
problem  area  is  selected  and  a  system  is  designed  which  extracts  only 
such  information  as  is  relevant  to  the  problem.   Typical  problem  areas 
include  the  recognition  of  printed  or  handwritten  characters,  the  con- 
trol of  assembly  equipment  in  industrial  plants,  the  recognition  of 
aircraft  from  radar  signals,  and  the  analysis  and  classification  of 
X-ray  photographs,  bubble  chamber  photographs,  satellite  pictures,  etc. 
It  should  be  noted  that  most  of  these  problems  involve  only  two- 
dimensional  images,  since  a  difficulty  in  the  recognition  of  three- 
dimensional  objects  is  that  their  shape  may  vary  drastically  when  viewed 
from  different  angles ,  so  that  either  a  knowledge  of  the  complete 
structure  of  the  object  or  a  large  number  of  views  of  the  same  object 
must  be  stored  and  processed. 

The  amount  of  information  contained  in  a  visual  image  is  very 
high,  yet  the  amount  of  information  required  in  the  output  of  an  image 
recognition  system  is  typically  quite  low,  e.g.  36  alphanumeric  characters, 
Thus  the  problem  is  to  remove  information  that  is  redundant  or  unrelated 
to  the  problem  area,  while  preserving  relevant  information.   Of  course, 
processing  becomes  easier  as  more  information  is  discarded.   The  informa- 
tion that  is  retained  (often  referred  to  as  a  "feature  vector")  is  then 
compared  to  stored  information  about  the  image  classes  to  be  recognized 


using  statistical  techniques  of  varying  complexity,  and  a  decision 
made  about  the  classification  of  the  image  [Duda  and  Hart,  1973]. 

1.2    Existing  Solutions 

A  number  of  techniques  have  been  employed  for  discriminating 
between  useful  and  redundant  or  useless  information,  of  which  most 
fall  into  a  few  broad  classes:   template  matching,  pattern  sampling, 
spatial  transforms,  geometrical  moments  and  feature  extraction 
[El-Sonni,  1978]. 

The  most  straightforward  procedure  is  template  matching,  in 
which  the  feature  vector  consists  of  the  points  making  up  the  image. 
Images  are  then  compared  by  performing  a  correlation  between  the  input 
image  and  the  stored  reference  images.   Since  the  original  images  are 
highly  redundant,  this  method  requires  large  amounts  of  data  storage 
and  is  also  not  conducive  to  handling  variations  of  size,  rotation 
and  other  factors. 

Pattern  sampling  involves  constructing  a  feature  vector  by 
measuring  the  input  pattern  at  known  points  on  the  image  plane,  for 
example,  by  recording  the  number  of  points  in  a  fixed  area  or  finding 
the  intersection  of  the  image  and  lines  with  a  known  direction.   This 
approach  is  less  sensitive  to  certain  types  of  variation,  and  has  been 
used  successfully  in  letter  sorting  [Fairhurst  and  Stonham,  1976]. 

A  more  sophisticated  approach  is  to  transform  the  image  in  order 
to  express  it  as  a  combination  of  a  set  of  orthogonal  functions.   In 


itself  this  does  not  provide  any  data  reduction  since  the  number  of 
transform  coefficients  is  the  same  as  the  number  of  points  in  the 
original  image.   However,  the  coefficients  are  less  correlated  than 
the  original  image  points  so  that  a  selected  subset  may  retain  most 
of  the  information  content  of  the  original  image.   In  this  respect 
the  Karhunen-Loeve  transform  has  been  shown  to  be  optimal  [Watanabe, 
1965],  [Fukunaga  and  Koontz,  1970],  but  is  tedious  to  calculate. 
However,  Andrews  has  shown  that  the  Fourier  transform  provides  results 
which  are  nearly  as  good  [Andrews,  1971]-   Fourier  coefficients  also 
have  the  advantage  that  they  are,  in  theory,  insensitive  to  transla- 
tion of  the  image,  although  not  to  rotation  or  magnification. 

The  Walsh  or  Hadamard  transform  does  not  have  this  property, 
and  performs  slightly  less  well  than  the  Fourier  transform  in 
eliminating  correlation  between  coefficients,  but  is  often  used  since 
Walsh  functions,  being  binary  valued,  are  easier  to  handle  computationally 
than  the  basis  functions  of  the  Fourier  transform,  which  are  complex 
exponentials. 

The  Karhunen-Loeve,  Fourier  and  Walsh  transforms  are,  by  their 
nature,  sensitive  only  to  global  properties  of  the  image.   In  many 
cases,  however,  it  would  be  useful  to  retain  more  information  about 
the  local  properties  of  the  image.   The  Haar  transform  is  locally 
sensitive  to  some  extent,  [Andrews  and  .Caspari,  1970],  but  it  is  less 
widely  used  since  no  particular  advantages  have  been  gained  from  this 
feature. 

All  orthogonal  transforms  require  a  significant  amount  of  computa- 
tion to  determine  the  transform  coefficients,  and  present  the  problem 


of  how  to  choose  the  best  subset  of  the  coefficients.   They  have 
produced  acceptable  results  in  many  areas,  but  seem  to  show  the 
greatest  utility  in  applications  for  which  the  transform  coefficients 
are  closely  related  to  the  properties  of  interest  in  the  image,  such 
as  geography  or  geology  [Robinson,  1975]. 

The  method  of  geometrical  moments  involves  weighting  the  points 
of  the  input  image  with  known  weight  functions  and  summing  the  result. 
By  normalizing  the  calculation  of  the  higher-order  moments  by  the 
values  of  the  lower-order  moments,  the  results  can  be  made  invariant 
to  operations  such  as  magnification,  translation,  rotation  and  skew 
[Hu,  1962].   They  are  also  sensitive  to  local  structure.   However,  this 
method  requires  considerable  computation,  and  provides  good  results 
only  for  certain  types  of  straightforward  variation,  since  it  amounts 
to  calculating  the  statistical  properties  of  the  image.   (Mean, 
standard  deviation,  skew,  kurtosis,  regression  coefficients,  etc. 
are  particular  examples  of  moment  values.)   Lower  order  moments  are 
often  used  for  preprocessing  image  normalization.   However,  for  general 
purpose  recognition,  it  has  been  shown  that  better  results  can  be 
obtained  by  simpler  means  [Kaufman  and  Breeding,  1976]. 

The  most  sophisticated  method  involves  considering  the  image 
as  a  combination  of  meaningful  features.   The  technique  is  then  to 
extract  these  features  from  the  image  and  examine  their  relationship. 
In  terms  of  removing  redundancy  from  the  image,  and  insensitivity  to 
variation,  this  approach  is  the  best.   However,  it  is  also  the  most 
difficult  since  it  requires  the  designer  to  select  a  suitable  set  of 


features  and  then  to  devise  a  means  of  detecting  them.   As  human  pattern 
recognition  appears  to  operate  similarly  in  some  respects,  there  is 
a  lot  of  interest  in  this  field.   In  particular,  we  have  designed  por- 
tions of  our  own  environment  in  terms  of  the  relationship  of  features, 
so  that  for  pattern  recognition  efforts  in  these  areas  feature  extrac- 
tion seems  the  most  promising  technique.   A  good  example  of  this  is  in 
the  recognition  of  handwritten  characters,  where  two  similar  patterns, 
"0"  and  "Q" ,  are  distinguished  only  by  a  single  feature — the  tail  of 
the  "Q". 

1. 3    Summary 

In  addition  to  these  "broad  areas,  researchers  have  examined 
various  other  techniques  such  as  autocorrelation  [Horwitz  and  Shelton, 
196l],  slope  histograms  [Sklansky,  1971],  etc.,  without  achieving  any 
remarkable  improvement  in  results.   Therefore,  it  appears  that  in  the 
selection  of  an  approach  to  image  recognition,  the  choice  should  "be 
governed  by  the  nature  of  the  problem  area  and  the  ease  of  implemen- 
tation.  Research  performed  by  members  of  the  Information  Engineering 
Laboratory  has  also  demonstrated  that  the  combination  of  a  carefully 
chosen  application  and  a  hardware  recognition  method  tailored  to  match 
it  can  produce  excellent  results  at  a  low  system  cost.   Examples  of 
this  approach  are  the  recognition  of  line  drawings  [Blandford,  197*0 
and  the  extraction  of  the  component  parts  of  a  landscape  picture 
[Simons,  1971]. 


2.   THE  OLREC  SYSTEM 

2. 1    Overview 

The  OLREC  ( On- Line  RECognition)  system,  proposed  by 
Dr.  W.J.  Poppelbaum  [Poppelbaum  and  Kubitz,  1978],  performs  image 
recognition  in  real  time  regardless  of  any  translation,  magnifica- 
tion or  rotation  of  the  image.   All  processing  is  performed  by  hard- 
ware and  is  completed  within  one  second.   The  amount  of  hardware 
required  is  small  enough  that  the  system  is  easily  portable.   The 
application  area  involves  only  two-dimensional  images  which  are  for 
the  most  part  continuous,  and  further  assumes  a  well-defined 
separation  between  the  figure  and  the  ground.   Images  of  this  type 
are  typical  of  the  field  of  industrial  robotics,  and  may  also  be 
encountered  in  other  areas  such  as  synthetic-aperture  radar  pictures 
of  ships  [Smith  and  Wright,  1971].   Conversations  with  the  research 
staffs  of  RCA  and  International  Harvester  have  confirmed  the  conten- 
tion that  two-dimensional  images  provide  sufficient  information  for 
the  recognition  of  three-dimensional  objects  in  an  industrial  applica- 
tion, since  experience  shows  that  the  projections  of  an  object  such 
as  a  machine  part  in  a  stable  orientation  tend  to  converge  to  a  small 
number  of  possibilities  (typically  no  more  than  three). 

The  image  is  detected  by  a  conventional  closed-circuit  televi- 
sion camera,  which  uses  on-line  feedback  control  of  a  zoom  lens  and 
pan-and-tilt  unit  to  normalize  the  size  and  position  of  the  image. 
The  image  is  then  sampled  along  radii  leading  outward  from  the  center 


of  the  picture,  by  performing  a  transformation  from  rectangular  to 
polar  coordinates  on-line  over  the  course  of  a  single  video  frame. 
This  simulates  the  operation  of  a  radial-scan  television  camera  at 
a  much  lover  cost  than  the  construction  of  a  custom  unit.   After 
video-slicing,  the  radial  image  is  stored  in  random-access  memory 
(RAM).   This  portion  of  the  system  is  described  further  by  its 
designer,  Randy  Moss  [Moss,  1978]. 

The  remainder  of  the  system,  which  is  described  in  detail  below, 
takes  the  radial  image  stored  in  RAM  and  compares  it  to  an  arbitrary 
number  of  stored  prototypes  in  parallel,  using  the  comparison  results 
to  generate  a  classification  of  the  image.   The  remaining  portion  of 
this  chapter  outlines  the  techniques  used.   It  is  assumed  that  the 
front  end  of  the  system  has  normalized  the  image,  so  that  the  problems 
of  translation  and  magnification  may  be  ignored.   The  recognition 
process  considers  rotation  to  be  the  only  possible  variation  of  the 
image.   The  effects  when  this  assumption  is  violated  are  discussed  in 
the  following  chapter. 

Surprisingly,  the  use  of  radial  images  has  seldom  been  proposed — 
one  rare  example  is  [Singer,  196l].   While  polar  coordinates  make 
certain  types  of  processing  such  as  stroke  following  more  awkward 
[Uhr,  1973],  they  provide  a  significant  advantage  in  other  cases, 
since  a  rotation  of  the  original  image  simply  produces  a  one-dimensional 
shift  in  the  stored  version.   The  utility  of  this  property  has  been 
amply  demonstrated  when  it  occurs  in  other  types  of  processing 
[Kaufman  and  Breeding,  1976],  [Sklansky,  1971]. 


2.2    The  Profile  Method 

Rather  than  simply  using  the  original  radial  image,  further 
advantages  in  ease  of  comparison  and  data  reduction  can  be  obtained 
by  using  a  profile  method,  as  demonstrated  by  the  Information  Engineering 
Laboratory's  NORMAN  project  [Huberts,  1975]-   This  technique  involves 
summing  the  image  points  in  a  given  direction,  and  may  be  thought  of 
as  producing  a  normalized  version  of  the  image  by  shifting  each  slice 
of  the  sampled  image  perpendicularly  to  an  axis  until  it  touches  the 
axis,  as  shown  in  Figure  2.1.   It  may  also  be  considered  as  a  projection 
of  zeroth-order  geometrical  moments.   Obviously,  the  profile  of  an  image 
is  not  unique.   In  fact,  one  disadvantage  of  the  profile  technique  as 
used  in  NORMAN  was  its  failure  to  distinguish  between  certain  objects 
which  we  would  like  to  consider  different,  as  in  the  examples  of 
Figure  2.2. 

However,  this  shortcoming  may  be  ameliorated.   The  main  problem 
lies  in  the  fact  that  the  profile  was  constructed  along  lines  which 
are  parallel  in  the  image  plane,  while  in  the  type  of  objects  which 
are  commonly  encountered  in  our  environment  we  usually  consider  skewing 
or  bending  in  a  single  direction  to  be  a  significant  feature  in 
distinguishing  between  two  classes  of  objects.   By  producing  a  profile 
of  a  radial  image,  this  type  of  confusion  is  avoided.   Also,  by 
producing  two  different  profiles  corresponding  to  integration  over 
the  range  of  the  two  variables  which  define  the  image  plane  ( in  this 
case,  r  and  8),  a  second  safeguard  is  provided. 
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Figure  2.1:      Profile  Method 
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Figure  2.2:   Two  Objects  Which  Cannot  Be  Distinguished  by  NORMAN 
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Figures  2.3  through  2.11  illustrate  the  technique  for  a 
simple  example.   The  original  image  of  Figure  2.3,  which  lies  com- 
pletely within  the  scanning  circle,  is  shown  in  Figure  2.k   sampled 
radially  with  €>h   points  on  a  radius  and  Q*   radii  (roughly  one  sample 
every  five  degrees).   Each  data  point  in  the  radial  scan  is 
represented  by  an  arc.   Figure  2.5  shows  how  the  radial  scan  would 
be  stored  in  memory.   The  radii  here  are  arranged  parallel  to  each 
other,  with  r  on  the  horizontal  axis  and  9   on  the  vertical  axis, 
rather  than  in  the  orientation  used  in  sampling  the  original  image. 
Therefore,  the  image  undergoes  a  "fan-like"  distortion,  so  that  the 
center  point  appears  along  the  6   axis.   However,  the  distortion  is 
the  same  for  all  stored  objects,  so  it  does  not  have  any  effect  when 
objects  are  compared. 

Figure  2.6  shows  a  histogram  of  the  values  obtained  by  inte- 
grating the  radial  scan  over  r — the  radial  profile.   The  circular 
profile  (Figure  2.7)  results  from  an  integration  over  8. 

A  clockwise  rotation  of  the  image  (Figure  2.8)  produces  a 
downward  (and  end-around)  shift  in  the  radial  scan  data,  as  shown 
in  Figure  2.9.   A  similar  shift  occurs  in  the  radial  profile  (Figure 
2.10),  while  the  circular  profile  (Figure  2.1l)  remains  unchanged 
except  for  some  sampling  noise  caused  by  not  rotating  by  an  integral 
number  of  angular  sampling  intervals. 
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Figure  2.3:   Original  Image 
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Figure  2.k:      Radial  Scan  of  Image 
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Figure  2. 5:      Radial  Scan   Data 
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Figure  2.6:   Radial  Profile 
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Figure  2.7:   Circular  Profile 
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Figure  2.8:   Radial  Scan  of  Rotated  Image 
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Figure  2.9:   Radial  Scan  Data  for  Rotated  Image 
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Figure  2.10:   Radial  Profile  of  Rotated  Image 
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Figure  2.11:   Circular  Profile  of  Rotated  Image 
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2. 3    Profiles  of  Radial  Images 

It  is  instructive  to  consider  the  operation  of  summing  an 
image  along  radii  of  constant  angular  spacing  and  around  circles 
of  constant  radius.   Although  different  images  may  produce  the  same 
results,  the  intermapping  is  by  no  means  as  obvious  as  in  the 
rectangular  example  of  Figure  2.2. 

In  the  first  case,  images  that  are  mapped  into  the  same  profile 
are  those  for  which  all  or  part  of  the  image  has  been  shifted  along 
the  radii,  for  example  Figure  2.12.   There  are  more  possibilities  for 
nondiscrimination  among  figures  that  occupy  only  a  few  points  of  the 
image  plane  than  among  those  which  have  a  large  area,  as  there  are 
more  shifting  possibilities  when  there  are  fewer  points.   Furthermore, 
figures  which  are  related  in  this  way  are  uncommon  in  most  applications. 

In  circular  summing,  figures  that  are  confounded  are  those  which 
have  all  or  part  of  the  image  shifted  around  a  circle  (Figure  2.13). 
Of  course,  this  includes  figures  that  are  rotated  versions  of  each 
other.   Again,  there  are  more  possibilities  for  nondiscrimination  among 
sparse  figures  than  for  figures  with  large  areas — for  example,  each 
circular  area  about  the  origin  has  a  unique  profile. 

The  two  types  of  nondiscrimination  are  quite  dissimilar. 
Therefore,  the  probability  of  two  objects  of  interest  being  non- 
discriminable  in  both  ways  simultaneously  is  low.   Moreover,  most  of 
the  patterns  which  are  confused  are  relatively  sparse  and  discoritinous , 
whereas  the  type  of  objects  we  hope  to  recognize  is  typically  large 
and  structured. 
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Figure  2.12:   Two  Objects  with  Radially  Nondiscriminable  Profiles 
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Figure  2.13:   Two  Obj 
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The  profile  method  is  also  more  tolerant  of  noise  than  the 
use  of  the  original  image.   Making  the  conventional  assumption  that 
noise  has  a  mean  value  of  zero,  any  summing  operation  will  tend  to 
remove  noise.   Also,  noise  that  is  not  removed  will  simply  result 
in  small  variations  in  the  edge  of  the  profile,  which  can  be  handled 
by  providing  some  tolerance  in  the  comparison  of  profiles. 

2.k  Classification  Method 

Classification  of  the  input  image  is  performed  using  the  nearest 
neighbor  rule — that  is,  the  input  image  is  identified  with  the  reference 
image  which  is  found  to  be  the  most  similar  to  it  by  virtue  of  having 
the  lowest  distance  measure  between  it  and  the  input  over  all  the 
reference  images.   The  nearest  neighbor  rule  was  chosen  because,  although 
it  is  one  of  the  simplest  classification  techniques  to  implement,  its 
performance  under  favorable  conditions  asymptotically  approaches  that 
of  the  theoretically  optimum  Bayes  classifier,  and  in  the  worst  case 
the  error  rate  is  never  worse  than  twice  that  of  Bayes*  rule.   That  is, 
in  an  infinite  data  set  at  least  half  of  the  classification  information 
resides  in  the  nearest  neighbor  [Duda  and  Hart,  1973]. 

It  is  necessary  to  define  a  measure  of  the  success  of  matching 
two  images.   An  image  is  represented  as  two  profiles,  each  of  which 
may  be  thought  of  as  a  vector  of  n  elements.   (The  value  of  n  is 
not  necessarily  the  same  for  the  two  types  of  profile.)   Each  element 
is  an  integer  representing  the  number  of  image  points  in  the  circle  or 
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radius.   Thus  an  element  by  element  subtraction  of  two  profile  vectors 
from  the  input  and  reference  images  will  produce  a  difference  vector, 
also  with  length  n. 

Since  the  objective  is  to  reduce  the  comparison  information  to 
a  single  value,  the  first  step  should  be  to  collapse  the  difference 
vector  into  one  observation  measuring  the  success  of  the  comparison  of 
the  two  profiles.   It  is  known  that  the  theoretically  optimum  method  to 
do  this  for  data  described  by  a  Gaussian  distribution  is  to  calculate 
the  Euclidean  distance  between  the  two  vectors  in  n-space  [Carl  and 
Hall,  1972].   However,  this  is  not  an  attractive  approach,  since  it 
requires  the  provision  of  complicated  arithmetic  operations  in  special- 
purpose  hardware,  with  adverse  effects  on  the  complexity  of  the  design, 
the  storage  requirements  and  the  speed  of  operation.   Moreover,  it 
is  contrary  to  the  design  philosophy,  expressed  in  Sections  1.3  and  2.1, 
of  minimizing  the  system  cost  by  the  use  of  simple  but  adequate 
techniques. 

Instead,  a  much  more  straightforward  approach  was  adopted,  and 
its  performance  verified  experimentally.   A  threshold  value  is  defined, 
and  an  element  of  the  difference  vector  is  said  to  represent  a  successful 
match  if  its  absolute  value  is  less  than  or  equal  to  the  threshold. 
The  success  of  the  comparison  of  the  two  vectors  is  judged  by  the 
number  of  successful  element  matches.   The  value  of  the  threshold 
should  be  sufficient  to  exclude  the  effects  of  sampling  noise  and 
inaccuracies  in  normalization,  without  admitting  spurious  matches.   For 
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n  =  6k ,  a  threshold  value  of  k    was  shown  experimentally  to  perform 
successfully. 

Since  the  circular  profile  is  invariant  to  rotation,  only  one 
vector  comparison  is  required.   However,  for  the  radial  profile  the 
effect  of  rotation  must  he  taken  in  account.   Therefore,  a  comparison 
is  performed  for  each  angular  position  of  the  input  object.   After  each 
vector  comparison,  the  input  profile  is  shifted  hy  one  element  and 
the  comparison  process  repeated.   The  best  match  result  over  all 
rotations  is  retained — if  the  two  objects  are  in  fact  the  same,  it  will 
represent  the  case  in  which  the  images  are  aligned.   If  desired,  the 
angular  value  may  be  retained  also. 

To  complete  the  process,  the  two  values  representing  the  number 
of  successful  element  matches  for  the  radial  and  circular  profiles  must 
be  combined  into  a  single  measure.   Since  calculating  the  Euclidean 
distance  between  two-dimensional  vectors  would  complicate  the  hardware 
design,  a  linear  combination  of  the  two  observations  seems  more 
attractive.   The  discussion  in  Section  2.3  indicates  that  the  comparisons 
of  the  radial  and  circular  profiles  tend  to  act  in  a  complementary 
manner — nondiscrimination  in  one  case  is  made  up  for  by  the  other. 
Therefore,  it  seems  reasonable  to  choose  an  equal  weighting.   In  hardware 
terms,  this  may  be  accomplished  simply  by  adding  the  two  values 
together. 

The  design  of  the  comparison  process,  described  above,  has  employed 
non-optimal  techniques  in  order  to  simplify  the  hardware  design  and 
to  increase  the  speed  of  operation.   It  must  be  demonstrated  that  these 
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compromises  have  no  adverse  effects  on  the  accuracy  of  operation  of 
the  system — that  the  qualitative  arguments  given  are  in  fact  correct. 
This  question  is  discussed  in  the  following  chapter. 

2. 5    Comparison  to  Current  Approaches 

Since  the  type  of  processing  to  be  done  by  the  system  is  similar 
to  that  required  in  many  industrial  robotics  applications,  it  would  be 
useful  to  compare  it  to  vision  systems  developed  in  this  area 
[Perkins,  1977],  [Saraga  and  Skoyles,  1976]. 

Typically,  the  industrial  robot  is  required  to  recognize  only 
a  small  number  of  objects  which  appear  at  a  known  distance  from  a 
fixed  camera.   Thus,  the  image  is  formed  in  two  dimensions  only.   Also, 
the  visual  environment  is  carefully  controlled  to  avoid  producing 
misleading  shadows  and  by  providing  a  constrasting  background 
[Agin,  1975],  [Tsuboi,  Tsuda ,  Shiraishi  and  Kosaka,  1973]. 

The  type  of  equipment  commonly  used  might  consist  of  a  television 
camera  (often  with  less  resolution  than  standard  video)  and  a  dedicated 
small  minicomputer.   The  tasks  performed  by  the  system  are  usually 
relatively  simple — reorientation,  packing,  simple  assembly  or 
rudimentary  inspection.   The  use  of  visual  processing  is  often 
limited  due  to  its  slow  speed. 

A  good  example  of  a  current  system  is  CONSIGHT-I,  developed 
at  General  Motors  Research  Laboratories  [Holland,  Rossol  and  Ward, 
1978].   A  linear  array  of  128  photodiodes  senses  the  image  of  an 
object  such  as  a  foundry  casting  as  it  passes  underneath  on  a  conveyor 
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belt.   A  clever  lighting  scheme  provides  separation  of  figure  and 
ground,  and  removes  the  possibility  of  misleading  shadows.   A  PDP  11 A5 
minicomputer  is  used  to  correlate  the  information  from  the  camera 
with  the  speed  and  position  of  the  conveyor  belt.   The  computer  is 
also  programmed  to  calculate  the  centroid  and  orientation  of  the  part, 
and  to  use  this  information  to  control  a  robot  arm  which  picks  up  the 
part  from  the  moving  belt.   A  considerable  amount  of  bookkeeping  is 
required  since  information  is  gathered  "on  the  fly",  and  it  is  also 
necessary  to  handle  multiple  objects  in  the  field  of  view  (which  are 
not  permitted  to  touch).   Only  one  type  of  part  is  expected,  but  the 
system  may  be  reprogrammed  by  passing  a  different  part  beneath  the 
camera  and  recording  its  characteristics.   The  processing  for  each 
part  is  reported  to  be  quite  slow  (about  20  to  30  seconds).   However, 
in  this  application  the  critical  restriction  is  the  slow  speed  of 
contemporary  robot  arms.   Thus  it  is  of  little  use  to  increase  the 
software  speed  beyond  this  point.   New  developments  in  robot  technology 
are  expected  to  approach  more  closely  the  typical  factory  speed  of 
one  part  per  second. 

A  system  such  as  OLREC  can  provide  several  advantages  for  this 
type  of  application.   It  can  handle  different  types  of  objects,  such 
as  unrelated  parts  which  are  cast  together  for  more  efficient 
utilization  of  the  volume  of  the  mold.   Also,  the  structure  of 
the  camera  system  enables  it  to  recognize  objects  at  varying  distances 
from  the  camera  and  in  different  locations  and  orientations.   Finally, 
it  demonstrates  that  by  the  use  of  a  parallel  hardware  implementation, 
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recognition  can  be  accomplished  very  rapidly,  and  at  a  reasonable 
cost.   The  entire  cost  of  OLREC  is  an  order  of  magnitude  less  than 
the  retail  price  of  a  PDP  11/^+5  such  as  the  one  used  by  GM.   The 
high  price  of  current  vision  systems  has  been  cited  as  the  reason 
for  their  lack  of  commercial  availability  [Allan,  1979]. 

It  should  be  noted  that  while  the  discussion  in  this  chapter 
has  focussed  solely  on  video  images,  the  method  and  implementation 
are  independent  of  the  means  used  to  acquire  the  image.   In  different 
applications  it  might  be  more  useful  to  use  other  techniques  such 
as  infrared  light,  radar  or  laser  rangef inding. 
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3.   SIMULATION  RESULTS 

3.1  The  Advantages  of  Simulation 

An  increasingly  common  view  in  the  methodology  of  system  design 
is  that  simulation  is  not  a  side  issue  in  the  design  effort,  but  rather 
an  integral  part  of  it.   That  is,  the  simulation  is  the  design.   This 
is  especially  true  for  hardware,  since  it  is  much  easier  to  experiment 
by  editing  a  program  than  by  rewiring.   Of  course,  simulation  results 
are  valid  only  up  to  a  point — no  hardware  can  be  said  to  be  verified 
until  it  is  actually  constructed.   The  value  of  simulation  lies  in  the 
separation  of  basic  conceptual  questions  from  implementation  details. 
This  makes  less  likely  that  backtracking  to  make  up  for  oversights  will 
be  necessary  in  the  later  stages  of  the  design. 

In  this  particular  example,  the  simulation  provides  several 
valuable  functions.   As  well  as  its  use  in  structuring  the  design 
process,  it  also  makes  possible  tightly  controlled  experiments  which 
cannot  be  done  with  the  hardware.   Since  these  experiments  may  precede 
the  hardware  design,  the  results  can  also  be  used  to  guide  the  choice 
of  particular  design  values.   Finally,  since  it  implements  the  recogni- 
tion algorithm,  it  can  be  used  in  the  comparison  of  hardware  and 
software  approaches  to  pattern  recognition. 

3.2  Description  of  the  Simulation 

The  simulation  is  written  in  PL/I  and  was  run  on  the  IBM  360/75 
system  at  the  University  of  Illinois.   It  is  slightly  over  one  thousand 
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lines  long,  comprising  about  seven  hundred  PL/I  statements  plus 
comments.   The  simulation  implements  the  recognition  algorithm 
described  in  the  previous  chapter — it  does  not  attempt  to  describe 
the  hardware.   To  provide  flexibility,  it  is  command-structured. 

The  user  can  construct  objects  from  three  basic  elements: 
rectangles,  right-angled  triangles  and  circles.   These  may  be  either 
black  or  white.   White  objects  erase  the  portions  of  previously 
defined  black  objects  which  they  overlap,  and  vice  versa.   Although 
all  conceivable  objects  are  not  easily  constructed  from  these  elements, 
a  sufficiently  rich  variety  for  experimental  purposes  is  readily 
available.   The  most  complicated  object  used  in  the  experiments 
required  only  nineteen  basic  elements. 

The  simulation  provides  the  ability  to  sample  the  defined 
image  radially  at  varying  angles,  to  construct  and  store  the  profiles, 
to  perform  comparisons  and  to  produce  graphical  output ,  which  is 
available  either  on  the  line  printer  (for  quick  turnaround)  or  the 
Calcomp  plotter  (for  more  visually  appealing  results).   Inaccuracies 
may  be  introduced  into  the  recognition  by  adding  noise,  shrinking  or 
expanding  the  image  or  translating  it  in  the  field  of  view.   Shrinkage 
and  expansion  are  accomplished  by  shifting  along  the  radii.   Therefore, 
some  distortion  is  produced  when  large  changes  are  made.   A  complete 
description  of  the  command  set  of  the  simulation  program  is  given  in 
Appendix  1. 

A  test  set  of  ten  objects  was  used  for  the  simulation  experiments, 
These  objects,  shown  in  Appendix  2,  are  stylized  images  representing 
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manufacturing  processes  which  were  developed,  for  industry  by  graphic 
artists  [Dreyfuss,  1972].   They  were  selected  not  only  "because  they 
are  the  type  of  objects  OLREC  was  intended  to  recognize  (as  described 
in  Section  2.1),  but  also  because  they  were  designed  for  easy  recogni- 
tion by  people  rather  than  by  machines.   Thus  they  provide  a  good 
comparison  with  more  sophisticated  feature-oriented  recognition 
techniques.   In  addition,  they  remove  the  possibility  of  unconscious 
bias  which  might  result  if  the  test  objects  were  created  specifically 
for  the  purposes  of  the  experiment. 

3.3    Experimental  Results 

The  OLREC  system  is  not  designed  to  recognize  arbitrary  objects. 
Its  performance  is  tailored  to  the  use  of  certain  properties  of  "nice" 
objects — that  is,  the  type  of  objects  we  expect  to  encounter  in  the 
real  world.   Since  a  rigorous  definition  of  "niceness"  appears  to  be 
impossible,  a  mathematical  analysis  of  the  method  would  be  of  little 
use.   Therefore,  performance  analysis  must  be  carried  out  experimentally. 

The  areas  of  investigation  to  be  examined  by  the  experiments 
are  the  performance  of  the  recognition  scheme  under  ideal  conditions, 
and  the  effects  of  noise  and  errors  in  centering  and  size  normalization. 

For  each  run  of  the  simulation,  each  of  the  original  ten  objects 
(the  training  set)  is  compared  to  the  ten  objects  of  the  altered  set 
(test  set).   Thus,  one  hundred  comparisons  are  performed,  ten  of  which 
are  correct  and  ninety  incorrect.   For  each  comparison  the  match  value 
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is  given  for  the  radial  match,  the  circular  match  and  the  combination 
of  the  two. 

The  point  of  interest  is  to  determine  whether  the  correct  match 
can  be  distinguished  from  the  incorrect  matches  solely  by  examination 
of  the  match  value.   Therefore  the  highest  and  lowest  match  values  are 
recorded  as  well  as  the  mean  and  standard  deviation  of  the  whole  set. 
The  necessity  to  reduce  the  large  amount  of  raw  data  has  introduced 
some  slight  conservative  bias  into  the  results,  since  although  the 
highest  incorrect  match  value  may  be  greater  than  the  lowest  correct 
value,  these  may  occur  for  different  objects  of  the  training  set,  each 
of  which  is  matched  correctly.   However,  the  basic  trends  are  clear. 

The  experiments  used  6k   radii  of  6k   points,  and  a  comparison 
threshold  of  k9   as  does  the  hardware.   These  values  were  selected  because 
previous  tests  showed  them  to  provide  acceptable  performance. 

Table  3.1  shows  the  performance  of  the  pattern-matching  method 
under  ideal  conditions.   The  test  set  is  derived  from  the  training  set 
by  rotation  by  a  non-integral  number  of  angular  sampling  intervals. 
Three  values  are  given  for  each  observation  in  the  table.   These  are 
respectively  the  radial,  circular  and  combined  values.   The  match  value 
is  expressed  as  a  percentage  of  the  best  possible  value. 

The  table  shows  that  the  method  operates  well.   Objects  matched 
to  themselves  give  a  perfect  match,  and  the  value  is  also  high  for  the 
match  to  the  rotated  version  of  the  same  object.   The  incorrect  matches 
have  much  lower  values,  and  there  is  a  "guard  band"  between  the  two 
sets  of  values.   Correct  and  incorrect  matches  are  clearly  distinguishable, 
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To  avoid  burdening  the  reader  with  a  mass  of  numbers,  the 
remainder  of  the  observations  in  the  form  of  Table  3.1  have  been 
relegated  to  Appendix  3.   Further  discussion  will  describe  the  results 
of  the  combined  match  value  only. 

Figure  3.1  shows  the  effect  of  introducing  noise  into  the  test 
set.   The  probability  that  a  bit  in  the  radial  image  is  inverted  is  given 
on  the  horizontal  axis.   The  solid  lines  show  the  means  of  the  correct 
and  incorrect  match  values.   The  dotted  lines  indicate  the  range  of 
the  observations.   As  predicted,  resistance  to  noise  is  good.   The  point 
where  the  lowest  correct  match  value  equals  the  highest  incorrect  value 
(that  is,  where  the  first  possibility  of  an  erroneous  conclusion  appears) 
is  with  a  noise  probability  equal  to  approximately  0.12 — about  one  bit 
in  eight  inverted.   Performance  continues  to  deteriorate,  with  the 
probability  of  an  erroneous  conclusion  becoming  steadily  greater,  until 
with  a  noise  probability  of  0.1+  the  correct  and  incorrect  matches  are 
almost  indistinguishable.   The  highest  match  value  now  contains  hardly 
any  information  about  the  correctness  of  the  match.   (Also,  the  test 
set  contains  hardly  any  information  about  the  shape  of  the  objects.) 

Examination  of  the  original  data  observations  shows  that  on  the 
whole  the  distribution  of  values  is  very  close  to  Gaussian.   The  ranges 
would  be  much  closer  to  the  mean  were  it  not  for  a  small  number  of 
outliers  (less  than  five  percent  of  the  total)  which  lie  at  the  critical 
edges  of  the  range  (the  low  edge  of  the  correct  matches  and  the  high 
edge  of  the  incorrect  matches).   It  seems  likely  that  these  depend  more 
on  the  actual  images  involved  than  the  mean  value.   Because  of  the 
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presence  of  these  outliers,  the  distributions  are  slightly  skewed  in 
the  opposite  direction.   If  the  outliers  were  ignored,  however,  the 
performance  would  be  somewhat  better.   This  suggests  than  in  situations 
where  an  incorrect  match  can  be  tolerated,  operation  may  continue  fairly 
reliably  for  some  distance  outside  the  ranges  of  safety  shown  in  this 
section. 

An  examination  of  the  data  also  shows  that,  as  predicted,  the 
combination  of  the  two  matching  types  performs  much  better  than  either 
one  by  itself.   For  example,  with  a  noise  probability  of  0.2,  the 
lowest  correct  circular  match  is  6.25$,  but  the  lowest  combined  value 
is  2k. 2.%.      The  outlier  problem  is  somewhat  reduced  by  this  effect. 

To  investigate  the  effects  of  comparing  objects  of  different 
sizes  (such  as  might  result  from  inaccuracies  in  the  normalization 
technique),  a  magnified  version  of  the  training  set  was  used  as  the 
test  set.   Figure  3.2  shows  the  results.   No  possibility  of  error  occurs 
until  the  objects  have  been  expanded  by  10$.   After  that  point,  the 
probability  of  error  increases  steadily,  as  in  the  previous  case. 
The  highest  incorrect  match  values  tend  to  increase  slightly  as 
significant  portions  of  the  objects  expand  outside  the  field  of  view, 
since  all  objects  begin  to  look  increasingly  the  same.   However,  the 
mean  incorrect  match  value  remains  approximately  constant.   The  particulars 
of  the  individual  data  observations  are  similar  to  those  described 
above. 

The  remaining  question  to  be  examined  is  the  effect  of  inaccuracies 
in  the  centering  method.   Figure  3.3  shows  the  results  of  translating 
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the  test  set.   The  horizontal  axis  is  the  distance  shifted,  as  a 
percentage  of  the  field  of  view.   The  first  possibility  of  error 
occurs  after  the  object  has  been  moved  by  13$.   The  results  are  quite 
similar  to  the  two  previous  cases. 

Examination  of  the  three  graphs  shows  an  interesting  phenomenon — 
as  conditions  get  worse,  the  correct  match  values  become  smaller,  but 
the  incorrect  values  remain  more  or  less  the  same.   This  suggests  that 
there  is  a  "natural  level"  for  the  match  value  of  unrelated  objects, 
having  a  mean  of  about  25%   and  a  maximum  value  of  about  50% .   As 
conditions  deteriorate,  the  correct  match  value  becomes  increasingly 
close  to  the  random  match  level.   Thus  the  magnitude  of  the  match  value 
is  a  reliable  indicator  of  the  probability  that  the  match  is  the  correct 
one.   A  50%   match  is  a  convenient  cutoff  point  above  which  it  is  almost 
certain  that  the  right  object  has  been  recognized. 

3 . k  Performance  of  So f tware  Recognition 

Since  the  simulation  program  implements  the  recognition  algorithm 
in  a  direct  manner,  it  may  be  used  as  a  standard  with  which  to  compare 
the  performance  of  the  hardware  system.   In  any  conventional  computing 
system,  processing  must  be  carried  out  sequentially.   Therefore,  the 
time  required  to  recognize  an  object  is  directly  proportional  to  the 
number  of  stored  objects.   OLREC  matches  all  objects  in  parallel,  so 
the  time  required  is  a  constant  dependent  on  the  clock  rate  of  the  system. 
As  stated  above,  a  recognition  time  of  one  second  was  stated  as  the 
design  goal. 
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The  operating  system  on  the  University  of  Illinois  IBM  360/75 
provides  an  entry  point  which  returns  the  number  of  centiseconds 
remaining  in  the  job  step.   Using  this  facility,  the  simulation 
calculates  the  raw  CPU  time  consumed  in  the  execution  of  each  command. 
The  simulation  runs  which  produced  the  data  described  in  the  previous 
section  performed  U 35  matches  at  a  time — all  possible  pairwise  combinations 
of  30  objects.   The  overhead  involved  in  performing  and  timing  the 
matches  is  only  a  small  percentage  of  the  total  time.   The  time  taken 
should  therefore  be  an  accurate  representation  of  the  CPU  time  used 
in  matching  operations. 

From  this  information,  the  average  time  needed  to  perform  a 
single  comparison  between  two  objects  was  calculated  to  be  0.26  seconds. 
(Each  run  took  about  six  minutes  of  CPU  time,  and  the  turnaround  time 
was  between  one  and  two  hours.)   This  indicates  that  OLREC  would 
provide  faster  recognition  than  such  a  dedicated  large  mainframe  for 
any  number  of  objects  greater  than  four.   Considering  the  difference 
in  cost  between  the  two  alternatives,  it  appears  that  the  use  of 
dedicated  hardware  is  superior  in  such  an  application. 

3. 5    Conclusions 

The  simulation  studies  of  the  recognition  algorithm  indicate 
that  excellent  performance  may  be  expected  despite  inaccuracies  of 
up  to  10%   in  the  image  of  the  object  to  be  identified,  and  that 
performance  deteriorates  in  a  reasonably  graceful  manner  beyond  this 
point.   Furthermore,  the  accuracy  of  the  identification  is  reliably 
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indicated  by  the  magnitude  of  the  match  value.   The  hardware 
implementation  has  also  been  demonstrated  to  be  faster  and  more 
cost-effective  than  recognition  performed  by  software. 
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k.      HARDWARE  DESIGN 

h. 1    Overall  Description 

This  chapter  contains  a  description  of  the  hardware  implementation 
of  the  recognition  algorithm  given  in  Chapter  2.   This  hardware  makes 
up  the  profile  storage  and  recognition  portion  of  the  OLREC  system. 
It  receives  its  input  from  the  image  acquisition  system,  or  "intelligent 
camera",  designed  by  Randy  Moss  [Moss,  1978]. 

The  system  was  designed  in  a  modular  fashion,  in  order  to  divide 
the  design  problem  into  smaller,  independent  sections.   This  approach 
is  particularly  evident  in  the  control  logic,  which  consists  of  four 
control  sections  which  operate  under  the  supervision  of  a  single  high- 
level  controller.   The  goal  of  this  philosophy  is  to  simplify  the 
design  process  "by  a  technique  of  "divide  and  conquer"  similar  to  the 
"structured  programming"  approach  popular  in  the  software  world.   While 
the  amount  of  hardware  used  may  possibly  be  increased  slightly  by  this 
type  of  design,  the  ease  of  implementation  gained  is  well  worth  the 
price. 

The  first  step  is  to  define  the  tasks  which  are  to  be  performed. 
They  are: 

To  accept  the  input  image  from  the  image  acquistion  portion  of 
the  system  and  store  it. 

To  construct  the  radial  and  circular  profiles  of  the  input  images. 

To  compare  these  input  profiles  to  the  stored  profiles. 
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To  choose  the  best  match  over  the  stored  profiles  and  display 
the  result. 

To  store  the  input  profiles,  if  desired. 


Figure  k.l   shows  that  these  tasks  may  be  structured  as  a  rather  trivial 
state  machine.   Each  state  of  the  machine  corresponds  to  a  single  task. 
The  state  of  the  machine  indicates  what  task  is  currently  being 
performed.   Unlabelled  state  transitions  take  place  when  a  task  is 
completed.   There  are  two  control  signals,  START  and  STORE,  which 
initiate  the  comparison  and  storage  processes  respectively. 

Since  the  construction  of  the  profiles  of  the  input  image  takes 
place  only  in  the  comparison  loop,  it  is  impossible  to  store  input 
profiles  without  first  performing  a  comparison.   Since  the  comparison 
requires  less  than  one  second,  this  is  not  a  great  inconvenience.   It 
is  easy  to  imagine,  with  the  addition  of  a  trifling  amount  of  logic, 
a  self-teaching  system  which  activates  STORE  whenever  a  low  magnitude 
of  match  value  indicates  that  a  previously  unencountered  object  has 
been  discovered.   However,  the  present  system  leaves  this  decision  up 
to  the  operator. 

As  is  customary,  the  states  are  assigned  numerical  values: 

State  0  -  Ready  for  input 
State  1  -  Construct  profiles 
State  2  -  Perform  comparison 
State  3  -  Choose  best  match  value 
State  h   -  Store  input  profiles 


STORE 


Figure  ^.1:   State  Machine  System  Description 


hi 


Figure  k.2  shows  a  block  diagram  of  the  system  architecture, 
which  is  oriented  around  three  buses.  Control  signals  are  shown  as 
single  lines. 

The  S-bus  ("scan  bus")  has  12  address  lines  and  one  data  line. 
This  bus  is  used  for  the  transfer  of  radial  scan  information.   In 
state  0,  the  system  idles  with  the  intelligent  camera  controlling  the 
S-bus  and  the  read/write  line  for  the  scan  memory,  which  holds  i+096  bits— 
6k   radii  of  6k   elements.   The  camera  system  stores  a  radial  image  in 
the  scan  memory,  and  then  signals  to  the  state  controller  via  the 
START  line. 

The  state  controller  places  the  system  in  state  1,  giving  control 
of  the  S-bus  and  P-bus  to  the  state  1  control  logic.   The  P-bus 
("profile  bus")  has  8  address  and  8  data  lines.   The  state  1  control 
reads  data  from  the  scan  memory  and  constructs  the  profiles  as  two 
vectors  of  6k   elements,  which  are  stored  in  the  profile  memory.   Since 
the  maximum  value  of  a  profile  element  is  6k ,  only  seven  data  bits 
are  really  necessary,  but  it  is  convenient  to  provide  eight  since 
integrated  circuit  packages  are  usually  designed  in  multiples  of  four 
bits. 

The  system  then  progresses  to  state  2.   Since  the  S-bus  is  no 
longer  needed,  control  of  it  may  be  returned  to  the  camera  system.   Thus, 
matching  and  image  acquisition  may  be  overlapped.   The"  state  2  control 
reads  the  input  profile  from  the  profile  memory  and  broadcasts  it 
to  the  pattern  units  on  the  M-bus  ("match  bus").   The  M-bus  also  has 
8  address  and  8  data  lines.   The  address  lines  specify  memory  elements, 
not  pattern  units. 
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Each  memory  unit  contains  a  previously  stored  pattern,  which 
it  compares  to  the  input  profile.   The  rotation  of  the  image  is 
accomplished  by  shifting  the  relationship  of  the  addresses  on  the 
P-bus  and  the  M-bus.   The  pattern  units  store  the  highest  match  value 
produced  over  all  shifts.   The  number  of  pattern  units  active  (the 
number  of  stored  patterns)  is  set  from  the  front  panel. 

The  state  3  control  addresses  each  pattern  unit  in  turn  via  the 
read  control  lines  of  the  S-bus  (RD).   When  it  is  addressed,  the 
pattern  unit  places  its  match  value  on  the  data  lines  of  the  M-bus. 
Since  a  perfect  match  value  is  128,  all  8  data  lines  are  necessary 
on  this  bus.   Note  that  there  is  no  provision  made  for  reading  out  the 
contents  of  the  pattern  unit  memory — as  far  as  the  system  is  concerned, 
it  is  a  write-only  memory. 

The  state  3  control  stores  the  best  match  value  and  the  number 
of  the  corresponding  pattern  unit.   This  information  is  also  displayed 
on  the  front  panel. 

To  store  a  pattern,  a  pattern  unit  is  addressed  from  the  front 
panel  and  the  STORE  line  is  activated.   The  state  k   control  reads  out 
the  input  pattern  from  the  profile  memory  and  uses  the  M-bus  data 
and  address  lines  to  store  it  in  the  memory  of  the  specified  pattern 
unit. 

In  addition  to  the  signals  shown,  there  is  also  a  RESET  line 
activated  from  the  front  panel.   RESET  always  brings  the  system  back 
to  state  0. 

The  remainder  of  this  chapter  will  consist  of  a  detailed  descrip- 
tion of  each  section  of  the  block  diagram. 
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h.2  State  Controller 

Figure  k.3   shows  the  state  controller  circuit.   For  clarity, 
the  system  clock  and  connections  to  front  panel  LED  drivers  have  been 
omitted  from  the  circuit  diagram. 

Although  the  system  is  fully  synchronous,  certain  techniques 
associated  with  asynchronous  circuits  have  been  employed  to  enhance 
modularity.   Each  state  control  subsystem  is  activated  by  its 
corresponding  STATE  line  being  set  high.   When  it  has  completed  its 
operation,  it  raises  its  DONE  line  to  indicate  to  the  state  controller 
that  the  next  state  may  be  activated.   When  the  STATE  line  goes  low, 
it  clears  the  DONE  flag  again.   Therefore  the  state  controller  is 
totally  independent  of  the  operation  of  any  of  the  state  control 
subsystems . 

The  two  loops  of  control  flow  shown  in  Figure  k.l   are  implemented 
as  shift  registers.   Each  shift  register  contains  a  single  one-bit 
whose  position  indicates  the  current  state.   The  operation  of  the 
shift  register  is  determined  by  two  control  bits,  Sn  and  S  .   When 
both  are  low,  shifting  is  inhibited.   When  both  are  high,  the  register 
is  loaded  from  the  parallel  inputs.   When  S_   only  is  high,  right 
shifting  is  performed,  and  left  shifting  when   S   only  is  high. 

If  the  RESET  line  is  forced  low,  all  of  the  NAND  gates  feeding 
the  S   and  S   inputs  have  a  high  output.   This  causes  the  shift 
registers  to  be  loaded  with  the  state  0  data  configuration:   the  left 
register  (for  the  compare  loop)  with  1000,  and  the  right  register 
(for  the  store  loop)  with  0010. 
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In  this  state,  either  the  START  or  STORE  inputs  may  be  raised 
to  commence  either  control  loop.   START  is  available  either  as  an 
external  signal  from  the  camera  system  or  from  a  front  panel  control 
for  testing  purposes.   START  and  STORE  are  enabled  only  when  the  system 
is  in  state  0. 

Activating  START  will  shift  the  contents  of  the  left  register 
right  one  position,  as  the  S-   input  of  that  register  will  become  high. 
The  right  register  remains  in  its  inactive  state.   As  soon  as  the 
shift  occurs,  S   will  become  low  again,  since   Q   is  now  low.   The 
system  is  in  state  lj  the  STATE1  line  is  high.   When  D0NE1  becomes  high, 
another  right  shift  will  take  place,  and  STATE2  will  become  active. 
Similarly,  D0NE2  will  initiate  state  3.   On  D0NE3  becoming  high,  the 
register  is  reloaded  to  its  original  configuration  by  forcing  both 
Sn   and   S   to  become  high. 

In  the  same  manner,  STORE  activates  state  h  by  shifting  the 
contents  of  the  left  register  right  one  position.  When  DONE^  goes 
high,  the  register  is  reloaded  to  return  to  state  0. 

Except  for  RESET,  all  input  signals  are  enabled  only  when 
the  system  is  in  a  state  in  which  it  makes  sense  to  accept  them. 
Thus,  for  example,  the  value  of  D0NE1  is  ignored  except  in  state  1. 
This  means  that  actions  occur  on  the  first  clock  pulse  after  the 
transition  of  the  signal,  and  the  length  of  time  that  the  signal 
level  persists  is  immaterial.   In  particular,  if  a  state  control  logic 
board  is  removed  during  testing,  its  DONE  line  will  float  high,  and 
the  corresponding  state  will  last  only  one  clock  period. 
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Since  the  STATE  signals  arbitrate  access  to  the  system  buses, 
it  is  extremely  undesirable  that  more  than  one  STATE  signal  should 
be  high  at  the  same  time.   This  situation  might  occur  when  power  is 
first  applied  to  the  system,  since  the  contents  of  the  registers  will 
be  unpredictable.   Therefore  an  RC  power-on  clear  is  provided  to 
ensure  that  the  system  will  come  up  in  a  safe  state — all  zeros.   No 
input  can  be  accepted,  but  activating  RESET  will  place  the  system  in 
state  0  ready  for  operation. 

k.3  State  1  Control 

The  processing  performed  in  state  1  involves  the  construction 
of  radial  and  circular  profiles  from  the  radial  scan  input  stored  in 
the  scan  memory.   Figure  h.k   shows  the  control  logic  required.   For 
clarity,  bus  drivers  and  system  clock  connections  have  been  omitted. 

The  radial  scan  image  is  Gh   by  64  bits  in  size.   Thus  12  bits 
are  necessary  to  address  a  single  bit  in  the  image.   The  high  order 
six  bits  address  the  row,  or  radius.   The  low  order  six  bits  address 
the  column,  or  point  on  the  radius.   Each  profile  contains  6h    elements 
of  eight  bits,  requiring  only  a  six-bit  address. 

To  construct  the  radial  profile,  a  twelve-bit  counter  is  used 
to  step  through  the  scan  memory.   The  contents  of  the  counter  are 
placed  on  the  S-bus  address  lines  ( SAQ  through  SA   ) .   This  counter 
is  formed  from  three  four-bit  counters  in  the  upper  left  corner  of 
Figure  k.k.      The  high  six  bits  of  the  counter  address  the  profile 
element,  so  they  are  also  placed  on  the  P-bus  address  lines 
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(PA«  through  PA^).   The  counters  in  the  circuit  are  held  in  a  cleared 
state  while  STATE1  is  low.   When  STATE1  goes  high,  they  begin  to  count 
and  operation  begins. 

If  the  value  on  the  S-bus  data  line  (termed  S)  is  high,  a 
second  counter  which  contains  the  profile  element  value  is  incremented. 
This  counter  is  shown  in  the  lower  right  corner  of  the  diagram.   The  con- 
tents of  this  counter  are  placed  on  the  P-bus  data  lines  (P   through  P_) 
After  each  element  has  been  constructed,  the  profile  memory  write 
enable  line  (PWE)  is  activated  to  store  the  value,  and  the  element 
value  counter  is  cleared  for  the  next  element. 

The  radial  profile  requires  a  row  by  row  summing  of  the  scan 
memory.   To  construct  the  circular  profile,  a  column  by  column  summing 
is  necessary.   The  twelve-bit  counter  goes  through  another  cycle, 
but  the  row  and  column  addresses  on  the  S-bus  are  interchanged  by  means 
of  a  multiplexor.   This  multiplexor  is  controlled  by  a  thirteenth 
counter  bit,  which  is  also  used  as  PA/-.   Thus  the  profile  memory, 
which  must  be  128  by  8  bits  in  size,  contains  the  radial  profile 
in  the  lower  half  of  its  address  space,  and  the  circular  profile  in 
the  upper  half.   Note  that   PA   is  not  used.   It  is  trivial  to  provide 
it  for  possible  future  expansion  since  the  integrated  circuits  used 
are  constructed  in  multiples  of  four  bits. 

Once  the  second  counter  cycle  is  completed,  the  fourteenth 
counter  bit  becomes  high.   This  bit  is  used  as  the  D0NE1  signal.  On 
the  next  clock  cycle  the  state  controller  will  respond  by  setting 
STATE1  low,  thereby  clearing  all  the  counters.   The  extra  clock  cycle 
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has  no  effect,  since  PWE  is  inactive  on  this  cycle. 

STATE1  is  also  used  to  control  the  access  of  the  state  1  control 
circuit  to  the  S-bus  and  P-bus  by  enabling  tristate  bus  drivers  on 
the  address  and  data  lines. 

k.k  Pattern  Unit 

The  pattern  units  perform  the  functions  of  storing  the  profiles 
and  comparing  them  to  the  input  profile.   Figure  ^4.5  shows  a  block 
diagram  of  a  single  pattern  unit. 

A  128  x  8  bit  RAM  is  used  to  hold  the  two  stored  profiles, 
radial  and  circular.   It  receives  its  data  and  address  from  the  M-bus , 
when  WE   (Write  Enable)  is  active.   Each  pattern  unit  has  its  own 
unique  WE  line,  as  is  described  in  Section  ^.8.   The  reason  for 
using  this  method,  rather  than  selecting  a  pattern  unit  by  decoding 
an  address,  is  that  there  is  no  space  on  the  pattern  unit  board  for 
decoding  logic.   It  was  thought  preferable  to  keep  the  pattern  unit 
logic  on  a  single  board  and  to  do  the  address  decoding  on  the  state  k 
control  board. 

The  comparison  is  performed  by  sending  an  address  to  the  memory 
on  the  M-bus  address  lines,  and  the  input  profile  element  to  the  com- 
parison logic  on  the  M-bus  data  lines.   By  applying  the  appropriate 
clocking  signals  to  the  comparison  logic,  a  comparison  of  the  value 
on  the  data  lines  and  the  output  of  the  memory  is  performed. 

The  comparison  logic  latches  its  inputs  at  the  beginning  of  the 
comparison,  so  that  comparison  time  and  memory  access  time  may  be 
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overlapped,  thereby  making  the  processing  speed  essentially  independent 
of  the  memory  speed.   When  the  comparison  process  is  complete,  the 
match  value  latch  contains  the  sum  of  the  best  radial  match  and  the 
circular  match. 

When  RD  (Read)  becomes  active,  the  match  value  is  placed  on 
the  M-bus  data  lines.   Each  pattern  unit  has  its  own  unique  RD  line, 
just  as  it  does   WE. 

The  first  step  in  the  comparison  process  described  in  Chapter  2 
is  to  calculate  the  absolute  value  of  the  difference  of  two  profile 
elements.   Figure  k.6   shows  the  method  employed.   The  start  signal 
initializes  the  circuit  by  loading  the  down  counters  with  the  two 
numbers  to  be  compared,  and  clearing  the  flipflop  and  the  up  counter. 
When  the  clock  signal  is  applied,  the  two  counters  are  counted  down. 
When  the  value  in  one  counter  is  about  to  become  zero,  its  borrow 
output  becomes  one,  and  the  flipflop  is  set  on  the  next  clock  pulse. 
The  borrow  output  of  the  other  counter  clears  the  flipflop.   The  up 
counter  counts  only  when  the  flipflop  is  set,  so  at  the  end  of  the 
process  it  will  contain  the  number  of  clock  pulses  between  the  two 
borrows,  i.e.  the  absolute  differences  of  the  two  counter  values.   If 
the  values  were  the  same,  the  flipflop  will  never  be  set  because  of 
the  action  of  the  exclusive  OR  gate.   Hence,  the  up  counter  value 
will  remain  zero.   The  number  of  clock  periods  required  to  ensure  a 
correct  answer  in  every  case  is  one  greater  than  the  maximum  value 
that  could  be  loaded  in  the  down  counters. 

It  would  be  possible  to  generate  the  absolute  difference  of 
two  numbers  in  a  shorter  time  by  subtracting  the  two  numbers  and 
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negating  the  result  or  reversing  the  inputs  if  the  result  were  negative. 
However,  this  approach  was  not  used  for  the  following  reasons.   First, 
it  would  require  more  logic,  which  would  preclude  fitting  a  pattern 
unit  on  a  single  board.   Since  the  down  counters  also  serve  as  latches, 
the  approach  of  Figure  k,6   requires  only  three  additional  elements 
to  compute  the  absolute  difference.   Also,  an  arithmetic  logic  unit  such 
as  the  7^l8l,  with  which  to  perform  the  subtraction,  requires  a  2k 
pin  package,  whereas  the  boards  used  provide  space  only  for  l6  pin 
packages.   Finally,  the  count  down  technique  does  not  take  so  much  time 
that  the  design  goal  of  one  second  would  be  exceeded,  so  it  is 
unnecessary  to  speed  up  the  operation  any  further. 

Figure  U.7  shows  the  pattern  unit  logic,  and  Figure  k.8  shows 
the  relationship  between  the  comparison  timing  signals  which  control 
it,  over  a  single  element  comparison  cycle. 

The  top  two  rows  of  counters  function  as  in  Figure  ^.6.   The 
next  row  of  counters  make  up  the  "hits  counter",  which  records  the 
number  of  profile  element  pairs  for  a  given  rotation  of  the  input 
image  whose  absolute  difference  is  within  the  tolerance  threshold. 
The  bottom  row  of  counters  make  up  the  "match  value  counter"  in  which 
the  ultimate  match  value  is  produced. 

To  begin  a  comparison,  it  is  necessary  to  clear  the  match  value 
counter.   This  is  done  by  pulsing  the  MRESET  signal  low.   At  the  same 
time,  the  addresses  on  the  M-bus  and  P-bus  are  set  up  to  ensure  that 
the  first  pair  of  data  values  will  be  present  when  required.   For  the 
time  being,  the  CIRC  signal  is  set  low. 
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For  the  first  element  of  a  rotation,  the  hits  counter,  which 
has  a  synchronous  clear  input,  must  "be  cleared.   Therefore,  C5  is 
taken  low  in  conjunction  with  C3,  as  shown  in  Figure  k.Q.      Next, 
C^  is  pulsed  low  to  load  the  down  counters  and  clear  the  flipflop 
and  the  second  counter.   Since  the  clear  inputs  act  synchronously, 
the  clock  signal  to  these  devices,  C2 ,  must  be  started  at  this  time. 
The  down  counters  have  an  asynchronous  load  and  do  not  require  a 
clock  pulse.   Finally,  the  count  down  clock,  CI,  is  started.   Since 
the  maximum  value  of  a  profile  element  is  6k,    CI  must  continue  for 
at  least  65  clock  cycles.   However,  as  soon  as  the  down  counters 
are  loaded,  the  address  lines  may  be  updated  to  set  up  the  next  pair 
of  data. 

When  the  absolute  difference  of  the  profile  elements  has 
been  calculated,  it  is  compared  to  the  threshold  value,  which  is  set 
by  switches  which  are  common  to  all  pattern  units.   On  recommencing 
the  comparison  cycle,  the  pulse  on  C3  will  increment  the  hits  counter 
if  the  difference  is  less  than  the  threshold.   C5  will  remain  high 
at  this  time. 

At  the  end  of  a  rotation,  two  more  clock  cycles  are  needed. 
The  first  is  a  pulse  on  C3  to  allow  the  last  possible  increment  of 
the  hits  counter.   Then  c6,  the  clock  signal  for  the  match  value 
counter,  is  pulsed.   If  the  value  in  the  hits  counter  is  not  less 
than  the  value  in  the  match  value  counter,  it  will  be  loaded  into 
the  match  value  counter.   Therefore,  the  match  value  counter  always 
contains  the  best  radial  match  value. 
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Once  all  6k   possible  rotations  of  the  input  image  have  been 
examined,  the  circular  profiles  must  also  be  compared,  and  the  match 
value  obtained  and  added  to  the  best  radial  match.   The  most  straight- 
forward way  to  do  this  is  to  increment  the  match  value  counter 
instead  of  the  hits  counter.   CIRC  is  set  high,  causing  the  multi- 
plexors it  controls  to  configure  the  match  value  counter  for  incrementing 
instead  of  loading.   C3  is  turned  off,  and  for  the  next  6h   element 
comparisons  C6  acts  as  C3  did  during  the  radial  comparisons.   At  the 
end  of  this  the  final  result  is  contained  in  the  match  value  counter. 

As  described  above,  this  value  may  be  placed  on  the  M-bus  data 
lines  by  lowering  RD,  and  the  memory  may  be  loaded  by  lowering  WE. 

k.5  State  2  Control 

The  purpose  of  the  state  2  control  logic  is  to  provide  the 
pattern  unit  timing  signals  shown  in  Figure  k.8   and  the  correct 
addresses  for  the  profile  and  pattern  unit  memories.   Figure  k.9 
shows  the  logic  required.   For  clarity,  system  clock  connections  to 
clocked  devices  are  again  omitted. 

When  STATE2  is  low,  all  devices  are  held  clear.  Thus,  when 
STATE2  goes  high,  the  first  pair  of  addresses  are  ready  to  be  set  up 
on  the  P-bus  and  the  M-bus.  At  the  same  time,  the  flipflop  in  the 
lower  lefthand  corner  of  the  diagram  is  set,  making  MRESET  low  and 
clearing  the  match  value  counters.  The  flipflop  also  provides  a  low 
pulse  on  CI  through  c6,  to  clear  the  other  counters  in  the  pattern 
units.   After  sixteen  clock  cycles,  enough  time  to  allow  the  memories 
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to  respond,  the  counter  clears  this  flipflop  and  loads  the  shift 
register  to  begin  normal  operation. 

Control  of  the  timing  signals,  CI  through  C6,  is  based  on  a 
shift  register  in  which  a  single  one-bit  is  shifted,  gating  the  low 
portion  of  the  system  clock  signal  on  to  the  timing  lines  as  required. 

In  the  initial  position  of  the  shift  register,  with  the  one-bit 
on  the  lefthand  side,  C3  is  set  low  as  the  system  clock  goes  low. 
If  the  M-bus  address  is  all  zeroes,  indicating  the  first  element  of 
a  rotation,  C5  will  be  set  low  also.   On  the  next  clock  cycle,  the 
register  is  shifted,  setting  C^  low  and  allowing  C2  to  start.   Also 
at  this  time  a  counter  is  loaded  with  the  value  190. 

On  the  next  clock  cycle,  as  the  shift  register  shifts  again, 
this  counter  begins  counting,  and  the  shift  register  is  not*  altered 
until  the  counter  overflows  at  the  value  256.   By  this  means,  the 
correct  number  of  CI  pulses  is  measured,  for  in  this  state  the  CI 
signal  is  enabled  also. 

Since  the  pattern  units  have  now  latched  the  data  values,  the 
memory  addresses  may  be  updated  to  allow  plenty  of  time  for  memory 
access.   Accordingly,  the  counter  which  contains  the  M-bus  address 
is  incremented  on  the  first  CI  cycle,  and  the  counter  for  the  P-bus 
address  on  the  second  CI  cycle.   D  flipflops  are  used  to  delay  the 
shift  register  output. 

Normally,  at  this  point  the  shift  register  is  reloaded  and 
the  cycle  repeats.   However,  at  the  end  of  a  rotation  some  further 
action  is  necessary.   This  condition  is  detected  by  the  M-bus  address 


61 


becoming  all  zeroes,  which  occurs  on  the  first  CI  cycle  of  the  last 
profile  element  of  the  previous  rotation.   The  M-bus  address  counter 
is  extended  in  order  to  count  the  6k   possible  rotations  of  the  input 
image.   These  six  bits  are  loaded  into  the  P-bus  address  counter  on 
the  second  CI  cycle  at  the  start  of  a  new  rotation.   This  serves  to 
advance  the  first  P-bus  address  of  a  rotation  by  one  on  each  successive 
rotation,  providing  thereby  an  effective  rotation  of  the  input  image 
by  one  angular  sampling  interval  (approximately  5.6  degrees). 

After  the  CI  cycles  for  this  case  have  concluded,  the  shift 
register  is  not  immediately  reloaded  as  before.   Instead,  two  additional 
shifts  are  performed,  which  serve  to  pulse  C3  and  C6 ,  as  shown  in 
Figure  k.8.      After  these,  the  register  is  reloaded  to  perform  the 
first  comparison  cycle  of  the  new  rotation. 

After  the  last  radial  profile  elements  have  been  compared,  the 
circular  profile  elements  are  compared.   This  is  indicated  by  the 
highest  bit  in  the  M-bus  address  counter   (R,)   becoming  high.   Since 
the  address  is  set  up  ahead  of  time,  the  indication  is  premature, 
and  must  be  clocked  into  a  flipflop  at  the  beginning  of  the  first 
circular  comparison  cycle.   This  flipflop  provides  the  CIRC  signal 
and  also  disables  C3,  routing  this  signal  instead  to  c6,  as  described 
in  the  previous  section. 

The  action  on  the  final  circular  comparison  is  slightly 
different  from  the  rest.   After  the  CI  cycles  are  complete,  the  shift 
register  is  reloaded  (thereby  providing  a  final  C6  pulse  to  clock 
the  match  value  counters)  and,  at  the  same  time,  the  flipflop  which 
provides  the  D0NE2  signal  is  set. 
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For  all  the  time  that  STATE2  is  high,  tristate  buffers  connecting 
the  data  lines  of  the  P-bus  and  the  M-bus  are  enabled.   This  allows 
the  output  of  the  profile  memory  to  be  provided  to  the  pattern  units 
over  the  M-bus. 

k.6  State  3  Control 

State  3  is  used  to  obtain,  store  and  display  the  best  match  value 
from  all  the  active  pattern  units.   Figure  ^.10  shows  a  circuit  diagram 
of  the  state  3  control  logic. 

The  validity  of  the  stored  information  is  indicated  by  a  "match 
valid"  flipflop  whose  output  appears  on  a  front  panel  LED.   This  flipflop 
is  set  by  D0NE3 ,  and  is  cleared  by  STATE1  (that  is,  when  a  new  comparison 
is  begun)  or  by  RESET. 

A  counter  is  held  clear  when  STATE3  is  low.   When  STATE3  goes 
high,  the  counter  begins  to  count  up.   The  counter  value  is  decoded 
to  give  RD  for  the  pattern  units.   Thus,  as  the  count  increases,  each 
pattern  unit  is  addressed  successively  by  its  RD  going  low,  and  it 
places  its  match  value  on  the  M-bus  data  lines.   The  counter  value 
is  compared  to  the  front  panel  switch  setting  which  indicates  the 
number  of  active  pattern  units.   When  the  two  are  equal,  the  count 
stops  and  the  D0NE3  flipflop  is  set. 

An  eight-bit  latch,  cleared  by  STATE1  or  RESET,  is  used  to  hold 
the  highest  match  value.   The  value  in  the  latch  is  compared  to  the 
value  on  the  M-bus  data  lines.   If  the  incoming  data  value  is  higher, 
it  is  loaded  into  the  latch.   At  the  same  time,  a  four-bit  latch  is 
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loaded  from  the  counter  with  the  number  of  the  corresponding  pattern 
unit.   Thus,  at  the  end  of  state  3,  the  highest  match  value  and 
the  number  of  the  corresponding  pattern  unit  will  be  stored  in  these 
latches.   This  information  is  displayed  on  LEDs  on  the  front  panel. 

k.f  State  k   Control 

In  state  k ,   the  input  profiles  are  stored  in  one  of  the  pattern 
unit  memories.   Since  the  input  profiles  are  constructed  during  state  1, 
a  comparison  step  must  always  precede  storage.   The  state  h   control 
logic,  shown  in  Figure  b.ll,  is  quite  straightforward. 

A  counter  is  held  clear  when  STATED  is  low.   When  STATED  goes 
high,  the  counter  begins  counting  up.   The  counter  value  is  placed  on 
the  address  lines  of  both  the  P-bus  and  the  M-bus ,  and  the  data  lines 
of  the  P-bus  and  M-bus  are  connected  together  by  enabling  a  tristate 
buffer.   In  this  manner  the  contents  of  the  profile  memory  are  read 
out  step  by  step  and  placed  on  the  M-bus  data  lines. 

The  pattern  unit  in  which  the  profiles  are  to  be  stored  is 
addressed  from  switches  on  the  front  panel.   When  STATED  goes  high 
the  switch  value  is  decoded  and  the  WE  line  for  the  addressed  pattern 
unit  goes  low.   The  values  read  out  of  the  profile  memory  will  be 
taken  off  the  M-bus  by  the  pattern  unit  and  stored  at  the  same  address 
in  the  pattern  unit  memory. 

Once  the  counter  has  passed  its  maximum  value,  127,  the  DONE^ 
flipflop  is  set  and  the  store  operation  is  complete. 
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Figure  U  .  11 :   State  J+  Control 
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h. 8    Additional  Comments 

The  system  is  constructed  on  10  x  ik   cm  wirewrap  boards  which 
have  space  for  up  to  2h   l6-pin  IC  packages.   Each  board  has  hk 
backplane  connectors.   As  far  as  is  possible,  each  architectural 
component  (as  in  Figure  k.2)    is  constructed  on  a  single  board.   However, 
in  some  cases  this  cannot  be  done  because  of  a  limitation  in  board 
space  or  number  of  connectors.   A  list  of  the  boards  and  a  functional 
description  is  supplied  in  Appendix  k.      The  front  panel  and  operating 
procedures  are  described  in  Appendix  5« 

The  system  does  not  have  its  own  power  supply  or  system  clock. 
These  are  to  be  supplied  from  the  intelligent  camera.   The  requirements 
are  approximately  8A  at  5V  for  the  power  supply,  and  a  3.58  MHz  clock. 
This  frequency  is  chosen  since  it  corresponds  to  the  subcarrier  frequency 
of  the  camera. 

Standard  TTL  logic  is  used  for  the  most  part,  mixed  with  some 
low-power  Schottky  TTL,  subject  to  availability.   The  boards  are  designed 
so  that  each  module  will  place  at  most  a  single  TTL  load  on  a  bus. 
Access  on  to  a  bus  is  always  controlled  by  tristate  bus  drivers  which 
are  enabled  by  the  associated  STATE  signal.   This  ensures  that  at  no 
time  do  two  different  modules  attempt  to  control  the  same  bus. 

The  number  of  clock  cycles  required  to  perform  a  matching 
operation  can  be  calculated  from  the  operation  of  the  relevant  circuit 
modules,  states  1,  2  and  3.   Thus,  state  1,  which  examines  the  contents 
of  a  ^096-bit  memory  twice,  requires  8192  clock  cycles. 
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Each  element  comparison  requires  67  clock  cycles,  as  may  be 
seen  in  Figure  k.8.      Examination  of  an  entire  profile  requires  6k 
element  comparisons  plus  two  extra  clock  cycles — U290  clock  cycles 
in  total.   This  must  he  repeated  for  Gh   possible  rotations,  plus  one 
circular  profile  comparison.   Also,  sixteen  clock  cycles  are  used  for 
a  reset  period.   Therefore,  state  2  requires  278,866  clock  cycles 
in  all. 

State  3  requires  one  clock  cycle  for  each  stored  profile.   In 
comparison  with  state  2,  this  is  an  infinitesimal  amount  which  may 
be  ignored.   The  total  number  of  clock  cycles  required  for  comparison 
is  therefore  computed  to  be  the  sum  of  states  1  and  2,  approximately 
287,000. 

Hence,  to  meet  the  design  goal  of  a  recognition  time  of  one 
second,  a  clock  speed  of  287  KHz  is  sufficient.   At  a  clock  rate  of 
3.58  MHz,  recognition  will  be  accomplished  in  approximately  80  milli- 
seconds.  This  is  slightly  less  than  the  time  required  to  scan  three 
full  video  frames  at  the  North  American  scan  rate  of  30  frames  per 
second.   Thus,  recognition  is  performed  on-line  at  speeds  commensurate 
with  video  signals. 
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5.   DISCUSSION  OF  RESULTS 

This  thesis  has  demonstrated  the  validity  of  the  propositions 
in  Chapters  1  and  2  concerning  on-line  image  recognition  by  dedicated 
hardware  and  the  radial  profile  technique. 

The  simulation  results  demonstrate  under  controlled  conditions 
that  representing  an  object  by  its  radial  and  circular  profiles  is  a 
useful  technique  for  the  removal  of  redundant  information,  and  one  which 
provides  rotational  invariance  in  a  simple  manner.   An  easily  computed 
comparison  measure  for  the  profiles  has  been  developed  and  demonstrated 
not  only  to  be  valid,  but  also  to  be  tolerant  of  noise  and  inaccuracies 
in  centering  and  size  normalization.   No  errors  were  encountered  until 
the  image  degradation  exceeded  10$,  and  performance  deteriorated  gracefully 
thereafter.   Furthermore,  the  measure  was  shown  to  provide  by  its 
magnitude  a  built-in  estimate  of  the  accuracy  of  the  classification. 

The  hardware  implementation  of  the  algorithm  demonstrates  that 
dedicated  hardware  can  provide  an  effective,  low-cost  solution  to  the 
image  recognition  problem  in  practical  applications.   The  system 
exploits  parallel  operation  to  achieve  recognition  in  approximately 
80  ms ,  irrespective  of  the  number  of  objects  in  the  training  set.   This 
is  over  an  order  of  magnitude  faster  than  the  design  goal  of  one  second 
recognition  time,  and  very  close  to  commercial  video  rates.   In  contrast, 
the  time  taken  by  a  dedicated  large  mainframe  to  perform  the  same  task 
has  been  measured  at  260  ms  for  each  object  in  the  training  set. 
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The  amount  of  hardware  required  to  perform  this  task  is  small 
enough  that  it  may  be  carried  without  difficulty  by  one  person.   In 
view  of  the  difference  in  cost  between  this  and  the  large,  general 
purpose  mainframe  whose  performance  it  exceeds,  it  is  clear  that 
dedicated  hardware  can  provide  a  more  efficient  solution  to  the  problem. 
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1.   Simulation  Commands 

The  simulation  program  is  written  in  PL/1  and  is  707  statements 
in  length.   The  input  to  the  program  consists  of  user  commands  which 
control  the  operation  of  the  simulation.   The  input  format  is  free. 
However,  because  of  the  input  conventions  of  PL/1,  alphabetic  commands 
must  be  surrounded  by  single  quotation  marks. 

The  first  four  data  items  must  be  integers  specifying  in  order 

the  following  parameters  which  are  used  to  define  storage  areas: 

the  number  of  bits  on  a  side  of  the  rectangular  image 
(which  is  square) 

the  number  of  points  on  a  radius 

the  number  of  radii 

the  number  of  profiles  to  be  stored 

Images  are  built  up  out  of  component  objects  using  rectangular 
coordinates  within  the  unit  square  by  means  of  the  '  CR'  ("create") 
command.   An  object  may  be  a  rectangle,  right  angled  triangle  or 
circle.   Except  for  circles,  objects  are  constrained  to  lie  completely 
within  the  unit  square.   While  not  all  possible  images  may  easily  be 
created  using  these  resources,  a  sufficiently  rich  repertoire  for 
experimental  purposes  is  conveniently  available.   The  format  is 

'CR»  obj  obj  ...  50 
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The  list  of  "obj"s  may  be  one  or  more  in  length.   An  "obj"  may  be  any 
one  of  the  folio-wing: 

Xl  yl  X2  y2     A  rectanSle  witn  corners  (x  ,  y  )  and  (x„,  y^). 

10  x  y  x~  y   A  right  angled  triangle  with  corners  (x  ,  y  ) , 
(x2,y1)  and  (x^  y3). 

20  x  y  r      A  circle  with  center  (x  ,  y  )  and  radius  r. 

30  Indicates  that  all  following  objects  will  be 

white  instead  of  black,  which  is  the  default. 
White  objects  erase  any  portions  of  a  previously 
defined  black  object  with  which  they  intersect. 

1+0  Indicates  that  all  following  objects  will  be  black. 

50  Indicates  the  end  of  the  list  of  objects. 

(x,  y  and  r  are  real  numbers.) 

The  command  'PI'  ("print  image")  will  print  the  most  recently 
defined  image.  Each  '  CR'  command  automatically  blanks  out  the  image 
storage  before  cor..   acting  the  new  imaf.  . 

To  create  a  radial  image  from  the  most  recently  defined  rectangular 
image,  the  command  is 

•SA'  6  ("sample") 

which  causes  radial  sampling  to  be  done  at  a  rotation  angle  of   0  degrees, 
where  6   is  a  real  number.   Sampling  is  done  within  the  unit  circle 
only.   The  'PR'  ("print  radial")  command  prints  the  radial  image  with 
the  radii  in  their  sampled  position.   The  "PS'  ("print  scan")  command 
prints  the  radii  side  by  side,  as  they  are  stored. 
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To  construct  the  two  profiles  of  the  radial  image,  the  command 
format  is 

'SP'  n  ("store  profile") 

where  n  is  an  integer.   The  profiles  will  be  stored  by  the  profile 
number  n.   They  may  be  printed  by  the  command 

'PP'  n  ("print  profile") 

For  permanent  archiving  of  complicated  images,  the  'PU'  ("punch") 
command  is  provided.  This  punches  the  radial  image  out  on  cards.  Since 
all  images  defined  so  far  have  consisted  of  less  than  twenty  elements, 
it  has  been  convenient  to  retain  the  cards  containing  the  ' CR1  statement 
for  the  image.  Thus,  there  has  been  as  yet  no  need  to  bother  to  program 
the  opposite  number  of  the  'PU'  command  to  read  the  radial  image  back 
in.   This  would  be  a  trivial  task  to  add,  should  it  ever  prove  necessary. 

For  testing  it  is  necessary  to  introduce  inaccuracies  into  the 
images.   The  commands  provided  for  this  purpose  are: 

'TR'  x  y   ("translate")  The  rectangular  image  is  translated  in 
the  x  and  y  directions  by  the  distances  given 
(which  may  be  zero).   Anything  shifted  out  of  the 
unit  square  is  lost. 

'SH'  f    ("shrink")  This  approximates  shrinking  of  the  radial 
image  to  a  fraction  f  of  its  original  size  by 
shifting  the  points  on  the  radii  toward  the  center. 
Points  shifted  into  the  center  point  are  lost. 

'EX'  f    ("expand")  This  approximates  expansion  of  the  radial 

image  to  f  times  its  former  size  by  shifting  the  points 
on  the  radii  outward  from  the  center.   The  center  point 
supplies  new  values. 
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1 NS'  p  ("noise")  This  inserts  pseudorandom  noise  into  the 
radial  image,  p  is  the  probability  that  each  bit 
will  be  inverted. 

Two  commands  are  supplied  for  performing  comparisons. 

'CO*  n  n2   ("compare") 

will  perform  a  comparison  between  the  two  sets  of  stored  profiles  n 
and  n~.   The  command  'MA'  ("match  all")  will  compare  all  combinations 
of  stored  profiles  and  print  the  output  in  three  triangular  matrices 
showing  the  best  radial  match,  the  circular  match  and  the  sum  of  the 
two. 

The  output  of  the  various  print  commands  described  above  is 
produced  on  the  line  printer.   While  this  provides  timely  and  informative 
output,  it  is  sometimes  desireable  to  have  more  aesthetically  pleasing 
images.   Therefore,  a  set  of  routines  was  added  which  produces  output 
on  the  Calcomp  plotter.   These  routines  have  their  own  command  set  which 
is  invoked  by  the   PL'  ("plot")  command.   All  plots  are  approximately  15cm 
square.   The  plot  command  set  consists  of  the  following: 

'RECT'    Plot  the  current  rectangular  image.   Each  image  point 
is  represented  by  a  grid  square  around  the  point. 

'SAMP'    Plot  the  currently  stored  radial  image  in  position. 
Each  image  point  is  represented  by  an  arc  over  the 
angular  sampling  interval. 

'POLR'    Plot  a  graph  of  the  stored  radii  of  the  radial  image. 

'PROF'  n  Plot  a  graph  of  the  radial  profile  and  of  the  circular 
profile  for  profile  number  n. 
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EXIT'    Return  to  the  main  command  set, 


The  results  of  these  plot  routines  may  be  seen  in  Chapter  2  and  in 
Appendix  2. 
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2.   Objects  Used  in  Simulation  Experiments 
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Figure  A. 1:      Calendering 
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Figure  A. 2;      Casting 
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Figure  A. 3:   Compression  Molding 
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Figure  A. 4:      Extrusion 
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Figure  A. 5:   Injection  Molding 
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Figure  A. 6 :      Laminating 
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Figure  A. 7:   Material  Forming 
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Figure  A. 8:   Material  Removal 
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Figure  A. 9:   Rotational  Casting 
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Figure  A. 10 :      Thermoforming 
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3.   Simulation  Results 

The  tables  in  this  appendix  summarize  the  results  of  the  simula- 
tion experiments.   For  each  experimental  run  statistics,  consisting 
of  the  mean,  standard  deviation  and  extrema,  are  given  which  describe 
the  match  values  obtained  by  matching  the  original  training  set  to  a 
test  set  which  is  derived  from  it  by  the  introduction  of  inaccuracies 
of  differing  types  and  amounts.   Results  are  given  for  self-match,  that 
is,  when  objects  in  the  training  set  are  compared  to  the  corresponding 
objects  in  the  test  set,  and  match  with  others,  when  they  are  compared 
to  the  other  objects  in  the  test  set. 

For  each  table  entry,  three  values  are  given.   These  are 
respectively  the  radial,  circular  and  combined  values.   The  match  value 
is  expressed  as  a  percentage  of  the  best  possible  value. 
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4.   System  Circuit  Boards 


LOWER  RACK 


LED  Driver 
State  Sequencer 
Input 

Scan  Memory 

Profile  Memory 


State  2  Control,  Card  1 

State  2  Control,  Card  2 

State  3  Control,  Card  1 

State  3  Control,  Card  2 

State  k  Control,  Card  1 

State  k  Control,  Card  2 


UPPER  RACK 


Bus  Termination 
Threshold  Value  Switches 

Pattern  Unit  0 


Drive  circuits  for  LEDs  on  front  panel 

As  described  in  Section  k .2 

Generates  test  data  in  lieu  of  intelligent 
camera 

Contains  a  UKxl  bit  static  RAM  to  store 
the  input  image 

Contains  two  256xU  "bit  static  RAMs  to  store 
the  input  profiles 

As  described  in  Section  k . 5 
As  described  in  Section  k.6 
As  described  in  Section  k .7 


Contains  pullup  resistors  for  bus  termination 

Contains  switches  to  set  comparison  threshold 
value 

As  described  in  Section  h  .k 


Pattern  Unit  7 
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5.   Front  Panel  and  Operating  Procedures 

The  operating  procedures  in  this  appendix  refer  to  the  front 
panel  of  the  OLREC  system,  which  is  shown  below.  Before  commencing 
it  should  he  ensured  that  a  5V  power  supply  and  a  clock  signal  have 
been  connected. 

When  the  ON/OFF  switch  is  placed  in  the  ON  position,  the  PWR  LED 
will  light  up  and  power  will  be  applied  to  the  system.   Pressing  the 
RESET  switch  will  place  the  system  in  the  READY  state  and  clear  the 
MATCH  COMPLETE,  BEST  MATCH  and  MATCH  VALUE  LEDs. 

If  the  START  signal  is  to  be  supplied  externally,  the  START 
SOURCE  switch  should  be  in  the  EXT  position.   If  it  is  in  the  MAN 
position,  the  START  signal  is  supplied  by  pressing  the  MAN  START  switch. 

When  the  START  signal  is  given,  the  system  will  cycle  through 
the  PROFILE,  COMPARE  and  MATCH  states.   The  cycle  is  shown  on  the 
corresponding  LEDs.   The  MATCH  COMPLETE  LED  will  turn  off  at  the 
beginning  of  the  cycle  and  come  on  again  at  the  end.   This  LED  indicates 
the  validity  of  the  match  data  displayed. 

The  input  image  will  be  compared  to  all  stored  profiles  up  to 
and  including  the  number  set  on  the  NO.  OF  PROFILES  switches.   If 
profiles  have  not  been  stored  at  these  addresses,  the  results  will 
be  invalid.   At  the  end  of  the  comparison  cycle,  the  BEST  MATCH  LEDs 
will  display  the  number  of  the  stored  profile  data  which  best  match 
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the  input  image.   The  MATCH  VALUE  LEDs  display  the  match  value 
obtained. 

To  store  the  input  profiles,  the  number  of  the  pattern  unit 
where  the  data  are  to  be  stored  should  be  set  up  on  the  STORE 
ADDRESS  switches.   When  the  STORE  switch  is  pressed,  the  data  will  be 
stored.   This  is  shown  by  the  STORE  LED  turning  on.   When  the  store 
process  is  complete,  the  system  returns  to  the  READY  state. 

If  the  store  data  do  not  replace  existing  data  in  a  pattern 
unit,  it  should  be  remembered  that  the  NO.  OF  PROFILES  should  be 
increased  by  one  to  include  the  newly  stored  profiles  in  future 
comparisons. 
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